Concepedia

Concept

data augmentation

Parents

7.2K

Publications

637K

Citations

25.5K

Authors

3.9K

Institutions

About

Data augmentation is a methodological approach in machine learning and data science that involves artificially increasing the size and diversity of a training dataset. This is achieved by applying various transformations or modifications to the original data instances, generating new, synthetic examples while preserving essential characteristics relevant to the learning task. The concept investigates techniques for generating valid variations of existing data to improve model generalization, robustness, and performance, particularly in scenarios with limited data availability or to mitigate overfitting by exposing models to a wider range of potential inputs. Its significance lies in being a fundamental strategy for enhancing the training of complex models, especially deep neural networks, across domains such as computer vision, natural language processing, and audio processing.

Top Authors

Rankings shown are based on concept H-Index.

JS

Korea Advanced Institute of Science and Technology

ED

Google (United States)

QV

Google (United States)

ZW

The University of Texas at Austin

YY

University of Technology Sydney

Top Institutions

Rankings shown are based on concept H-Index.

Google (United States)

Mountain View, United States

Tsinghua University

Beijing, China